Spoken-style explanation generator for Japanese kanji using a text-to-speech system

نویسندگان

Yoshifumi Ooyama

Hisako Asano

Koji Matsuoka

چکیده

In this paper we describe a spoken explanation generator, PLANET, for Japanese Kanji (ideograms), especially Kanji used in people's names. A number of text-to-speech systems for Kanji texts have been proposed but this is the rst one that can explain Kanji characters so as to disambiguate characters from many homophone Kanji candidates. To accomplish this the generator explains the Kanji by using both internal composition and use in other words. The system has a database of over 6,000 Kanji characters that breaks down any given Kanji into its components and a text corpus of explanations for explaining two or more Kanji characters. It is capable of generating both other words that include the Kanji characters in question, and identifying information. Using these other words and the information the system makes phrases and sentences. Furthermore, this system generates natural prosodic information by classifying the pattern of semantic connections between words and phrases. The explanations are output through a natural-sounding voice synthesizer. Hearing examinations con rmed that this system achieves high accuracy in disambiguating Kanji characters from among many candidates. This system will make it possible to provide advanced and user-friendly human-computer interfaces.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sentence-Style Conversion of Japanese News Article for Text-to-Speech Application

This paper proposes a method of sentence-style conversion for generating spontaneous Japanese speech in a text-to-speech synthesis system. Since written language is different from spoken language linguistically, the speech by direct reading of written Japanese texts might be unnatural. The method takes a fully rulebased approach to convert the sentence style and to complement sentences, which a...

متن کامل

Parallel Speech Corpora of Japanese Dialects

Clean speech data is necessary for spoken language processing, however, there is no public Japanese dialect corpus collected for speech processing. Parallel speech corpora of dialect are also important because real dialect affects each other, however, the existing data only includes noisy speech data of dialects and their translation in common language. In this paper, we collected parallel spee...

متن کامل

Adding Japanese language synthesis support to the eSpeak system

In this paper we describe an addition to the eSpeak system which is capable of pronouncing Japanese language. This implementation is used for automatic segmentation of Japanese speech from Japanese text. The speech synthesiser that we use is part of the praat speech analyse program and is based on the eSpeak text-to-speech engine. Because the Japanese writing system is very complex i.e. it mixe...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

Recent Topics in Speech Recognition Research at NTT Laboratories

This paper introduces three recent topics in speech recognition research at NTT (Nippon Telegraph and Telephone) Human Interface Laboratories. The first topic is a new HMM (hidden Markov model) technique that uses VQ-code bigrams to constrain the output probability distribution of the model according to the VQ-codes of previons frames. The output probability distribution changes depending on th...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1996

Spoken-style explanation generator for Japanese kanji using a text-to-speech system

نویسندگان

چکیده

منابع مشابه

Sentence-Style Conversion of Japanese News Article for Text-to-Speech Application

Parallel Speech Corpora of Japanese Dialects

Adding Japanese language synthesis support to the eSpeak system

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Recent Topics in Speech Recognition Research at NTT Laboratories

عنوان ژورنال:

اشتراک گذاری